PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA02g18040
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HD-ZIP
Protein Properties Length: 736aa    MW: 81010.6 Da    PI: 5.9347
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA02g18040genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.88.8e-1963118156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 +++ +++t++q++e+e++F+++++p+ ++r+eL ++l+L   qVk+WFqN+R+++k
  CA02g18040  63 KKRYHRHTQHQIQEMEAFFKECPHPDDKQRKELGRRLELAPLQVKFWFQNKRTQMK 118
                 688999***********************************************998 PP

2START200.19.7e-632514721206
                 HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
       START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                 ela +a++el ++a+ +ep+W + s      + ++e+ ++f+++ +       ++ea+ras vv+m++ +lve+l+d + qW+  +a    + +t+ev+
  CA02g18040 251 ELAVSAMEELTRMAQTDEPMWITNSensiVTLCEEEYARTFPRGITgpkpltLNSEASRASSVVIMNPINLVEILMDAN-QWTSVFAglvsRGMTVEVL 348
                 57899******************9988888899**********999*********************************.******************* PP

                 CTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX CS
       START  86 ssg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp 177
                 s+g      galq+m+ae+q++splvp R+  f+Ry++q+ +g+w++vdvS+ds ++ p  +++ R   +pSg+li++++ng+s+vtwvehv+ +++ +
  CA02g18040 349 STGvagnynGALQVMTAEFQVPSPLVPiRENFFLRYCKQHDDGTWAVVDVSLDSLRPSP-VPPCRR---RPSGCLIKELPNGYSQVTWVEHVEADEKAV 443
                 *********************************************************99.466655...****************************** PP

                 HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
       START 178 hwllrslvksglaegaktwvatlqrqcek 206
                 h ++++lv+sgla+gak+wvatl+rqce+
  CA02g18040 444 HDMYKPLVSSGLAFGAKRWVATLERQCER 472
                 ***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.3E-2145118IPR009057Homeodomain-like
SuperFamilySSF466897.52E-1847120IPR009057Homeodomain-like
PROSITE profilePS5007116.01860120IPR001356Homeobox domain
SMARTSM003892.6E-1661124IPR001356Homeobox domain
PfamPF000461.9E-1663118IPR001356Homeobox domain
CDDcd000861.10E-1663121No hitNo description
PROSITE profilePS5084846.117242475IPR002913START domain
SuperFamilySSF559614.17E-35243474No hitNo description
CDDcd088757.07E-119246471No hitNo description
SMARTSM002344.9E-56251472IPR002913START domain
PfamPF018527.5E-54252472IPR002913START domain
Gene3DG3DSA:3.30.530.203.1E-6296471IPR023393START-like domain
SuperFamilySSF559612.34E-25492727No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 736 aa     Download sequence    Send to blast
MYKPNMFDSH QHLLDTPSST QKSQETEMDF LREEELESKS GTDIMEGQHS GDDQDPNQRP  60
TKKKRYHRHT QHQIQEMEAF FKECPHPDDK QRKELGRRLE LAPLQVKFWF QNKRTQMKAQ  120
HERCENTHLR NENDKLRAEN IRYKEALTNA SCPHCGGPAA IGEMSFDEQQ LRVENTRLRE  180
EIDRISGIAA KYVGKPMLNF PPHLPPPEAP RSLDLAFGPQ SGLLDEMYNV GDIFRTAIRG  240
LTDGEKPMVI ELAVSAMEEL TRMAQTDEPM WITNSENSIV TLCEEEYART FPRGITGPKP  300
LTLNSEASRA SSVVIMNPIN LVEILMDANQ WTSVFAGLVS RGMTVEVLST GVAGNYNGAL  360
QVMTAEFQVP SPLVPIRENF FLRYCKQHDD GTWAVVDVSL DSLRPSPVPP CRRRPSGCLI  420
KELPNGYSQV TWVEHVEADE KAVHDMYKPL VSSGLAFGAK RWVATLERQC ERLASAMANN  480
IQTGDVGIFT SPAGRKSMLK LAERMVRSFC AGVGTSTTHT WTTLSGSGAD DVRVMTRKSI  540
DDPGRPPGIV LSAATSFWIP VSPKRVFDFL RDENSRSEWD ILSNGGVIQE MAHIANGRDP  600
GNCVSLLRVN SGNSHQSNML ILQESSTDPT GSYVIYAPVD IVAMNVVLSG GDPDYVALLP  660
SGFAILPDGS TNHHGGSGSS SDVGSVGGSL LTVAFQILVD SVPTAKLSLG SVATVNSLIK  720
CTVDRIKSAV TPESA*
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016560096.10.0PREDICTED: LOW QUALITY PROTEIN: homeobox-leucine zipper protein PROTODERMAL FACTOR 2
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLM0ZPP80.0M0ZPP8_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000052770.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA9322491
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2